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1 Connputing graphical queries over XML data 
Sara Comai, Ernesto DamianI, Piero Fraternali 



October 2001 ACM Transactions on Information Systems (TOIS), volume 19 issue 4 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^ pdf(707.80 KB) 



The rapid evolution of XML from a mere data exchange format to a universal syntax for 
encoding domain-specific information raises the need for new query languages specifically 
conceived to address the characteristics of XML. Such languages should be able not only 
to extract information from XML documents, but also to apply powerful transformation 
and restructuring operators, based on a well-defined semantics. Moreover, XML queries 
should be natural to write and understand, as nontechnical person ... 



Keywords: Document restructuring, graphical query languages, semantics 



Streaming XML: Stream processing of XPath queries with predicates 
Ashish Kumar Gupta, Dan Suciu 

June 2003 Proceedings of the 2003 ACI^ SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available- 151 pdf(46460KB) Additional Information: full citation, abstract , references , citings, index 
. |Aj.kL_j : terms 

We consider the problem of evaluating large numbers of XPath filters, each with many 
predicates, on a stream of XML documents. The solution we propose is to lazily construct 
a single deterministic pushdown automata, called the XPush Machine from the given 
XPath filters. We describe a number of optimization techniques to mal<e the lazy XPush 
machine more efficient, both in terms of space and time. The combination of these 
optimizations results in high, sustained throughput. For example, if ... 

Path shariri g and predicate evaluation for hi g h-performance XML filterin a 
Yanlei Diao, Mehmet Altinel, Michael J. Franklin, Hao Zhang, Peter Fischer 
December 2003 ACM Transactions on Database Systems (TODS), volume 28 issue 4 

Publisher: ACM Press 

Full text available: ^ pdf (543.40 KB) Additional Information: full citation , abstract , references , dtings, index 
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XML filtering systems aim to provide fast, on-the-fly matching of XML-encoded data to 
large numbers of query specifications containing constraints on botli structure and 
content. It is now well accepted that approaches using event-based parsing and Finite 
State Machines (FSMs) can provide the basis for highly scalable structure-oriented XML 
filtering systems. The XFilter system [Altinel and Franklin 2000] was the first published 
FSM-based XML filtering approach. XFilter used a separate FSM per pa ... 

Keywords: Nondeterministic Finite Automaton, XML filtering, content-based matching, 
nested path expressions., path sharing, predicate evaluation, structure matching 



Efficient wire fornaats for high performance connputin g | 

Fabian E. Bustamante, Greg Eisenhauer, Karsten Schwan, Patrick Widener 

November 2000 Proceedings of the 2000 ACM/IEEE conference on Supercomputing 

(CDROM) 
Publisher: IEEE Computer Society 

Full text available: 'S.|3df( 134.14 KB ). Additional Infomiatlon: futi citation , abstract , references , citings, index 
W Publisher Site 

High performance computing is being increasingly utilized in non-traditional circumstances 
where it must interoperate with other applications. For example, online visualization is 
being used to monitor the progress of applications, and real-worid sensors are used as 
inputs to simulations. Whenever these situations arise, there is a question of what 
communications Infrastructure should be used to link the different components. 
Traditional HPC-style communications systems such as MPI offer r ... 

Queue Focus: Nine IM Accounts and Countin g 
Joe Hildebrand 

November 2003 Queue, volume l issue 8 
Publisher: ACM Press 

Full text available: ■@.pm52.MB)„!U Additional information: fu il citation , in dex terms 
html(22.25 KB) 



6 Packrat parsing:: sinnple. pov\/erfuL lazy, linear tinne. functional peari 
Bryan Ford 

September 2002 ACM SIGPLAN Notices , Proceedings of the seventh ACM SIGPLAN 

international conference on Functional programming ICFP '02, volume 

37 Issue 9 
Publisher: ACIVI Press 

r- nx ^ -. u. 01 c-7 i^Dx Additional Information: full citation , abstract, references , citings, index 

Full text available: TO pdf(171.57 KB) 

terms 

Packrat parsing is a novel technique for implennenting parsers in a lazy functional 
programming language. A packrat parser provides the power and flexibility of top-down 
parsing with backtracking and unlimited lookahead, but nevertheless guarantees linear 
parse time. Any language defined by an LL(/c) or LR(/c) grammar can be recognized by a 
packrat parser, in addition to many languages that conventional linear-time algorithms do 
not support. This additional power simplifies the handii ... 

Keywords: Haskell, backtracking, lexical analysis, memolzation, parser combinators, 
scannerless parsing, top-down parsing 
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Database session 6: XML: Light-weight xPath processing of XML stream with 
deterministic automata 
Makoto Onizuka 

November 2003 Proceedings of the twelfth international conference on Information 
and knowledge management 

Publisher: ACM Press 

I- I. * ^ I ui fiit ^*/c-7ft -7c i^Dx Additional Information: full citation , abstract , references , citings, index 

Full text available: TO pdf(579.76 KB) ^ 

terms 

Several applications based on XML stream processing have recently emerged, such as 
those for air traffic control and the selective dissemination of information (SDI). Their 
common need is to process a large number of XPath expressions in continuous XML 
streams at high throughput.This paper proposes four techniques for XPath expression 
processing based on Deterministic Finite Automata (DFA) for two purposes: to improve 
the memory usage efficiency of the automata and to support the processing of b ... 

Keywords: XPath processing, automata, selective dissemination of Information, 
streaming XML 



An XML query en g ine for network-bound data 
Zachary G. Ives, A. Y. Halevy, D. S. Weld 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, volume 11 Issue 4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^pdf(351.86 KB) Additional Information: full citation , abstract , citings , index terms 

XML has become the lingua franca for data exchange and integration across 
administrative and enterprise boundaries. Nearly all data providers are adding XML import 
or export capabilities, and standard XML Schemas and DTDs are being promoted for all 
types of data sharing. The ubiquity of XML has removed one of the major obstacles to 
integrating data from widely disparate sources - namely, the heterogeneity of data 
formats. However, general-purpose Integration of data across the wide are a also re ... 

Keywords: Data integration. Data streams, Query processing, Web and databases, XML 



M esh-base d cont ent routin g usin g XML I 
Alex C. Snoeren, Kenneth Conley, David K. Gifford 

October 2001 ACM SZGOPS Operating Systems Review , Proceedings of the eighteenth 
ACM symposium on Operating systems principles SOSP '01, volume 35 issue 

5 

Publisher: ACM Press 

Full text available: pdf(1.24 MB) Additional information: full citation , abstract, references , dtiogs. index 

We fiave developed a new approach for reliably multicasting time-critical data to 
heterogeneous clients over mesh-based overlay networks. To facilitate intelligent content 
pruning, data streams are comprised of a sequence of XML packets and forwarded by 
application-level XML routers. XML routers perform content-based routing of individual 
XML packets to other routers or clients based upon queries that describe the information 
needs of downstream nodes. Our PC-based XML router prototype can route ... 

10 Streaming XML: XPath queries on streaming data 
^ Feng Peng, Sudarshan S. Chawathe 

June 2003 Proceedings of the 2003 ACM SIGI40D international conference on 
Management of data 
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We present the design and implementation of the XSQ system for querying streaming 
XML data using XPath 1.0. Using a clean design based on a hierarchical arrangement of 
pushdown transducers augmented with buffers, XSQ supports features such as multiple 
predicates, closures, and aggregation. XSQ not only provides high throughput, but is also 
memory efficient: It buffers only data that must be buffered by any streaming XPath 
processor. We also present an empirical study of the performance character ... 

11 Optimizing the lazy DFA approach for XML stream processing Q 
Danny Chen, Raymond K. Wong 

January 2004 Proceedings of the fifteenth conference on Australasian database - 
Volume 27 CRPIT '04 

Publisher: Australian Computer Society, Inc. 

Full text available: ^pdf d 90.92 KB ) Additional Information: full citation , abstract , references 

Lazy DFA (Deterministic Finite Automata) approach has been recently proposed to for 
efficient XML stream data processing. This paper discusses the drawbacks of the 
approach, suggests several optimizations as solutions, and presents a detailed analysis for 
the processing model. The experiments show that our proposed approach is indeed 
effective and scalable. 

Keywords: XML, XPath, lazy DFA, stream data 

12 Context-based prefetch - an optimization for implementing objects on relations Q 
Philip A. Bernstein, Shankar Pal, David Shutt 

December 2000 The VLDB Journal — The International Journal on Very Large Data 

Bases, volume 9 Issue 3 

Publisher: Springer-Verlag New York. Inc. 

Full text available: ^pdf (142.24 KB) Additional Information: full citation , abstract , index terms 

When implementing persistent objects on a relational database, a major performance 
issue is prefetching data to minimize the number of round-trips to the database. This is 
especially hard with navigational applications, since future accesses are unpredictable. We 
propose the use of the context in which an object is loaded as a predictor of future 
accesses, where a context can be a stored collection of relationships, a query result, or a 
complex object. When an object O's state is loaded, similar ... 

Keywords: Caching, Object-oriented database. Object-relational mapping. Prefetch 

13 Al gorithms and programming models for efficient representation of XML for Internet Q 
^ ap plications 

^ Neel Sundaresan, Reshad Moussa 

April 2001 Proceedings of the lOth international conference on World Wide Web 

Publisher: ACM Press 

Full text available: "g) pdf(352.97 KB) Additional Infonmatlon: full citation , references , citings, index terms 



Keywords: DOM, SAX, WBXML, XML, compression 
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>^ Leonidas Fegaras, David Levine, Sujoe Bose, Vamsi Chaluvadi 

^ November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 
Publisher: ACM Press 

I- II * ^ -I ui A ^ti^Aa izc^ lyox Additional Information: full citation , abstract , references , citings, index 
Full text available: T5j pqt(24d.5o kb) 

We are addressing the efficient processing of continuous XML streams, In which the server 
broadcasts XML data to multiple clients concurrently through a multicast data stream, 
while each client is fully responsible for processing the stream. In our framework, a server 
may disseminate XML fragments from multiple documents in the same stream, can repeat 
or replace fragments, and can introduce new fragments or delete invalid ones. A client 
uses a light-weight database based on our proposed XML alge ... 

Keywords: XML, databases, query optimization, query processing 

15 Visualisin g reusable software over the web | 
Stuart Marshall, Kirk Jackson, Robert Biddle, Michael McGavin, Ewan Tempero, Matthew 
Duignan 

Decennber 2001 Australian symposium on Information visualisation - Volume 9 
CRPITS '01 

Publisher: Australian Computer Society, Inc. 

I- 11*^ -I ui 01 oo ^/lD^ Additional Information: full citation , abstract , references , citings , index 
Full text available: TO pdf 1 38 MB) ^ 

This paper describes an architecture we have developed for web-based visualisation of 
remotely executing software. The motivation for this worl< Is to allow users of web-based 
software repositories to explore existing code components and frame-works, to see what 
they do, and create interactive visual documentation of that code based on the 
developer's actions. This visual documentation can be used to determine what the code or 
framework does, how it does it, and whether it can be reused In ... 

Keywords: code reuse, software visualisation, web-based code repositories 



16 XML linkin g 
Steven J. DeRose 

December 1999 ACM Computing Surveys (CSUR) 
Publisher: ACM Press 

Full text available: ^ pdf(1 54.81 KB) Additional Information: full citation, references, citing s, index terms 



17 Industry session 1: information retrieval: XML parsing: a threat to database 
^ performance 

Matthias Nicola, Jasmi John 

November 2003 Proceedings of the twelfth international conference on Information 

and knowledge management 
Publisher: ACM Press 



eii*^ •• ui 0 oa UD\ Additional Information: full citation , abstract, references , citings, index 

Full text available: Tk\ p aT (2io.oD kb ; 

terms 

XML parsing is generally known to have poor performance characteristics relative to 
transactional database processing. Yet, its potentially fatal impact on overall database 
performance is being underestimated. We report real-word database applications where 
XML parsing performance is a key obstacle to a successful XML deployment. There Is a 
considerable share of XML database applications which are prone to fail at an early and 
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simple road block: XML parsing. We analyze XML parsing performance an ... 
Keywords: DOM, SAX, XML, database, parser, performance, validation 

18 Research centers: Database research at UT Arlington ||| 
Sharma Chakravarthy, Alp Aslandogan, Ramez ElmasrI, Leonldas Fegaras, JungHwan Oh 
March 2003 ACM SIGMOD Record, volume 32 issue 1 

Publisher: ACM Press 

Full text available: ^ pdf(91 .35 KB) Additional Information: full citation 



19 Virtual extension: A comparison of B2B e-service solutions 
Dan Jong Kim, Manish Agrawal, Bharat Jayaraman, H. Raghav Rao 
December 2003 Communications of tlie ACM, volume 46 issue 12 

Publisher: ACM Press 

Full text availab!e:g.p^2M^^ ^^^.^.^„3, (.formation: full citation , references , index terms 
g1 html(25.71 KB) 



20 Industry session 1: information retrieval: Managing IFC for civil enqineering projects Q 

^ Renaud Vanlande, Christophe Cruz, Christophe Nicolle 

>^ November 2003 Proceedings of tlie twelftli international conference on Information 
and Icnowledge management 

Publisher: ACM Press 

Full text available:^ pdf(255.03 KB) Additional Information: full citation , abstract , references , index terms 

The "Industrial Foundation Classes" (IFC) are an ISO norm to define all components of a 
building in a civil engineering project. IFC files are textual files whose size can reach 100 
megabytes. Several IFC files can coexist on the same civil engineering project. Due to 
their size, their handling and sharing is a complex task. In this paper, we present an 
approach to automatically identify business objects in the IFC files and simplify their 
visualization and manipulation on the Internet. We const ... 

Keywords: 3D, DBMS, IFC, XML, collaborative application, semantic 
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21 The University of New Brunswick's pilot for an electronic theses and dissertation | 

prog ram 

Janice El-Bayoumi, Lisa Charlong 

September 2003 Proceedings of the 31st annual ACM SZGUCCS conference on User 

services 
Publisher: ACM Press 

Full text available: 'g| p df (215.37 KB) Additional Infonmation: full citation , abstract , references , index terms 

In November 2002, the University of New Brunswick and UNB's Graduate Student 
Association (GSA) began an Electronic Thesis and Dissertation (ETD) pilot program. An 
ETD is an electronically published thesis or dissertation. By publishing ETDs on the web, a 
student's work is easily and quickly accessible to the research community. UNB's pilot was 
based on information obtained from the National Digital Library of Theses and 
Dissertations, discussions with universities participating in ETD programs, ... 

Keywords: ETD, digital dissertations, dissertation, electronic publishing, thesis 



22 Technical correspondence: XVF: C++ introsoection by extensible visitation Q 
^ Kurt Stephens 

V August 2003 ACM SIGPLAN Notices, volume 38 issue 8 
Publisher: ACM Press 

Full text available: 'g|pdf ( 144.43 KB) Additional Information: full citation , abstract , references 

Object serialization and object inspector user interfaces are concerns that can be 
orthogonally implemented using introspection via meta-object protocols (MOP). The C++ 
language lacks a formal meta-object protocol, although some are available as source pre- 
processors. A full MOP is not necessary for many classes of problems where basic 
introspection is useful. This paper describes a technique: extensible visitation of objects 
as a basic introspection primitive. 

23 The view selection problem for XML content based routing Q 
^ Ashish Kumar Gupta, Dan Suclu, Alon Y. Halevy 

^ June 2003 Proceedings of the twenty-second ACM SIGMOD-SZGACT-SXGART 
symposium on Principles of database systems 

Publisher: ACM Press 

^ , ^ ., . , Additional Information: full citation , abstract , references , citings , index 
Full text available: 
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We consider the view selection problem for XML content based routing: given a networl<, 
in which a stream of XI^L documents is routed and the routing decisions are tal<en based 
on results of evaluating XPath predicates on these documents, select a set of views that 
maximize the throughput of the network. While in view selection for relational queries the 
speedup comes from eliminating joins, here the speedup is obtained from gaining direct 
access to data values in an XML pacl<et, without parsing tha ... 

24 Database issues for event-based middleware: MJoin: a metadata-aware stream join Q 
o perator 

Luping Ding, Ell<e A. Rundensteiner, George T. Heineman 

June 2003 Proceedings of the 2nd international workshop on Distributed event- 
based systems 

Publisher: ACM Press 

Full text available: ^ pdf(229.21 KB) Additional Information: full citation, abstract , references 

Join algoritlims must be re-designed when processing stream data instead of persistently 
stored data. Data streams are potentially infinite and the query result is expected to be 
generated incrementally instead of once only. Data arrival patterns are often 
unpredictable and the statistics of the data and other relevant metadata often are only 
known at runtime. In some cases they are supplied interleaved with the actual data in the 
form of stream markers. Recently, stream join algorithms, like Sym ... 

Keywords: Metadata, XML Stream, XQuery Subscription, constraint, join algorithms, 
optimization 




25 The Next Bang: The Explosive Combination of Embedded Linux, XML and Instant Q 

Messaging 
Doc Searls 

September 2000 Linux Journal 

Publisher: Specialized Systems Consultants, Inc. 

Full text available: [g| htmlf34.52 KB) Additional Information: full citation , references , index terms 



26 XML transactions: Efficient synchronization for mobile XML data Q 
^ Franky Lam, Nicole Lam, Raymond Wong 

^ November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Publisher: ACM Press 

Full text available: ^ pdf(1 16.31 KB) Additional Information: full citation , abstract , references , index terms 

Many handheld applications receive data from a primary database server and operate in 
an intermittently connected environment these days. They maintain data consistency with 
data sources through sychronization. In certain applications such as sales force 
automation, it is highly desirable if updates on the data source can be reflected at the 
handheld applications immediately. This paper proposes an efficient method to 
synchronize XML data on multiple mobile devices. Each device retrieves and cac ... 

Keywords: XML, information dissemination, information subscription, path containment 
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Eril< Duval, Eddy Forte, Kris Cardinaels, Bart Verhoeven, Rafael Van Durm, Koen Hendrikx, 
Maria Wentland Forte, Norbert Ebel, Maciej Macowicz, Ken Warkentyne, Florence HaennI 
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May 2001 Communications of the ACM, volume 44 issue 5 
Publisher: ACM Press 

Full text available: mmmMm Additional Information: full citation, references , citings , index terms 
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28 Database session 5: management of data streams: Raindrop: a uniform and layered I I 
^ algebraic framework for XQueries on XML streams 
^ Hong Su, JInhui Jian, EIke A. Rundensteiner 

November 2003 Proceedings of the twelfth international conference on Information 
and knowledge management 

Publisher: ACM Press 

Full text available: "g) pdf(705.69 KB) Additional Infonmation: full citation, abstract , references , index terms 

XML stream applications bring the challenge of efficiently processing queries on 
sequentially accessible token-based data. While the automata model is naturally suited for 
pattern matching on tokenized XML streams, the algebraic model In contrast is a well- 
established technique for set-oriented processing of self-contained tuples. However, 
neither automata nor algebraic models are well-equipped to handle both computation 
paradigms. 



The goal of the Raindrop project is t ... 

Keywords: XML stream, XQuery algebra, query processing 



29 Technical correspondence: Object serialization analysis and comparison in Java Q 
^ and .NET 

^ Marjan Hericko, Matjaz B. Juric, Ivan Rozman, Simon Beloglavec, Ales Zivkovic 
August 2003 ACM SIGPLAN Notices, volume 38 issue 8 
Publisher: ACM Press 

Full text available: ^ pdf(339.96 KB ) Additional Information: full citation , abstract , references , citings 

This article compares binary and XML object serialization on Java and Microsoft .NET 
platforms from the performance and size perspective. It uses three different types of 
objects and different number of objects to make a comparison which reflects real-world 
circumstances. The article has the following contributions: (1) it compares binary and XML 
serialization between Java and .NET to compare the efficiency of both platforms; (2) it 
compares binary and XML serialization within the platforms to c ... 

Keywords: .NET, Binary, Java, Serialization, XML 



30 Video anywhere: a system for searching and managing distributed hetero geneous 
video assets 

Amit Sheth, Clemens Bertram, Kshitij Shah 
March 1999 ACM SIGMOD Record, volume 28 issue 1 

Publisher: ACM Press 

Full text available: ^ pdf(545.62 KB) Additional Information: full citation , abstract , index terms 

Visual information, especially videos, plays an increasing role In our society for both work 
and entertainment as more sources become available to the user. Set-top boxes are 
poised to give home users access to videos that come not only from TV channels and 
personal recordings, but also from the Internet in the form of downloaded and streaming 
videos of various types. Current approaches such as Electronic Program Guides and video 
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February 2001 Linux Journal 

Publisher: Specialized Systems Consultants, Inc. 
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The "Next Bang" prophecy fulfilled. 
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CORPORATE Linux Journal Staff 

September 2001 Linux Journal, volume 2001 issue 89 
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A unified framework for data translation over the Web 

Torlone, R.; AtzenI, P.; 
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Data Engineering, 2004. Proceedings. 20th International Conference on 
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